DOMAIN DATABASE KNOWLEDGE Incompleteness
نویسندگان
چکیده
There are several diierent ways data mining (the automatic induction of knowledge from data) can be applied to the problem of natural language processing. In the past, data mining techniques have mainly been used in linguistic engineering applications to solve knowledge acquisition bottlenecks. In this paper, we show that they can also assist in linguistic theory formation by providing a new tool for the evaluation of linguistic hypotheses, for the extraction of rules from corpora, and for the discovery of useful linguistic categories. Applying Quinlan's C4.5 inductive machine learning method to a particular linguistic task (diminutive formation in Dutch) we show that data mining techniques can be used (i) to test linguistic hypotheses about this process, and (ii) to discover interesting linguistic rules and categories.
منابع مشابه
Planning in Incomplete Domains
Engineering complete planning domain descriptions is often very costly because of human-error or lack of domain knowledge. While many have studied knowledge acquisition, relatively few have studied the synthesis of plans when the domain model is incomplete (i.e., actions have incomplete preconditions or effects). Prior work has evaluated the correctness of plans synthesized by disregarding such...
متن کاملDOMAIN DATABASE KNOWLEDGE Incompleteness Noise
There are several di erent ways data mining the automatic induction of knowledge from data can be applied to the problem of natural language processing In the past data mining techniques have mainly been used in linguistic engineering applications to solve knowledge acquisition bottlenecks In this paper we show that they can also assist in linguistic theory formation by providing a new tool for...
متن کاملA FOIL-Like Method for Learning under Incompleteness and Vagueness
Incompleteness and vagueness are inherent properties of knowledge in several real world domains and are particularly pervading in those domains where entities could be better described in natural language. In order to deal with incomplete and vague structured knowledge, several fuzzy extensions of Description Logics (DLs) have been proposed in the literature. In this paper, we present a novel F...
متن کاملUtilizing Goal-Directed Data Mining For Incompleteness Repair In Knowledge Bases
In this paper we present a methodology for goal-directed data mining of association rules and incorporation of these rules into a probabilistic knowledge base. The purpose of the data mining and rule extraction process is to repair knowledge base incompleteness uncovered during validation. We discuss how this incompleteness is uncovered and show the fundamental forms this incompleteness can tak...
متن کاملfor u kit i
KNACK is a knowledge acquisition tool that generates expert systems for evaluating designs of electromechanical systems. An important feature of KNACK is that it acquires knowledge from domain experts without presupposing knowledge engineering skills on their part. This is achieved by incorporating general knowledge about evaluation tasks in KNACK. Using that knowledge, KNACK builds a conceptua...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995